Overview

Dataset info

Number of variables48
Number of observations101763
Missing cells4073 (0.1%)
Duplicate rows0 (0.0%)
Total size in memory37.3 MiB
Average record size in memory384.0 B

Variables types

Numeric14
Categorical30
Boolean1
Date0
URL0
Text (Unique)0
Rejected3
Unsupported0

Warnings

citoglipton has constant value "No" Rejected
diag_1 has a high cardinality: 717 distinct values Warning
diag_2 has a high cardinality: 749 distinct values Warning
diag_3 has a high cardinality: 790 distinct values Warning
diag_3 has 1423 (1.4%) missing values Missing
encounter_id is highly correlated with df_index (ρ = 0.9678103131) Rejected
examide has constant value "No" Rejected
num_procedures has 46652 (45.8%) zeros Zeros
number_emergency is highly skewed (γ1 = 22.85527244) Skewed
number_emergency has 90380 (88.8%) zeros Zeros
number_inpatient has 67627 (66.5%) zeros Zeros
number_outpatient has 85024 (83.6%) zeros Zeros
race has 2271 (2.2%) missing values Missing

Variables

a1cresult
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
None
84745
>8
 
8216
Norm
 
4990
ValueCountFrequency (%) 
None 84745 83.3%
 
>8 8216 8.1%
 
Norm 4990 4.9%
 
>7 3812 3.7%
 
Max length4
Mean length3.763607598
Min length2
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsTrue

acarbose
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101455
Steady
 
295
Up
 
10
ValueCountFrequency (%) 
No 101455 99.7%
 
Steady 295 0.3%
 
Up 10 < 0.1%
 
Down 3 < 0.1%
 
Max length6
Mean length2.011654531
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

acetohexamide
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101762
Steady
 
1
ValueCountFrequency (%) 
No 101762 > 99.9%
 
Steady 1 < 0.1%
 
Max length6
Mean length2.000039307
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

admission_source_id
Numeric

Distinct count17
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean5.75445889
Minimum1
Maximum25
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q11
Median7
Q37
95-th percentile17
Maximum25
Range24
Interquartile range6

Descriptive statistics

Standard deviation4.06410966
Coef of variation0.7062540089
Kurtosis1.744953515
Mean5.75445889
MAD2.976668374
Skewness1.029942076
Sum585591
Variance16.51698733
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=17)
Histogram
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 4.5 ... 10.5 15.5 18.5 21. 25. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
7 57492 56.5%
 
1 29564 29.1%
 
17 6781 6.7%
 
4 3187 3.1%
 
6 2264 2.2%
 
2 1104 1.1%
 
5 855 0.8%
 
3 187 0.2%
 
20 161 0.2%
 
9 125 0.1%
 
Other values (7) 43 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 29564 29.1%
 
2 1104 1.1%
 
3 187 0.2%
 
4 3187 3.1%
 
5 855 0.8%
 

Maximum 5 values

ValueCountFrequency (%) 
25 2 < 0.1%
 
22 12 < 0.1%
 
20 161 0.2%
 
17 6781 6.7%
 
14 2 < 0.1%
 

admission_type_id
Numeric

Distinct count8
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2.024016588
Minimum1
Maximum8
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q11
Median1
Q33
95-th percentile6
Maximum8
Range7
Interquartile range2

Descriptive statistics

Standard deviation1.445413768
Coef of variation0.7141313846
Kurtosis1.942418805
Mean2.024016588
MAD1.095259261
Skewness1.591977215
Sum205970
Variance2.089220961
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=8)
Histogram
Histogram with variable size bins (bins=[1. 1.5 3.5 4.5 5.5 6.5 7.5 8. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 53988 53.1%
 
3 18868 18.5%
 
2 18480 18.2%
 
6 5291 5.2%
 
5 4785 4.7%
 
8 320 0.3%
 
7 21 < 0.1%
 
4 10 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 53988 53.1%
 
2 18480 18.2%
 
3 18868 18.5%
 
4 10 < 0.1%
 
5 4785 4.7%
 

Maximum 5 values

ValueCountFrequency (%) 
8 320 0.3%
 
7 21 < 0.1%
 
6 5291 5.2%
 
5 4785 4.7%
 
4 10 < 0.1%
 

age
Numeric

Distinct count10
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean65.96685436
Minimum5
Maximum95
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum5
5-th percentile35
Q155
Median65
Q375
95-th percentile85
Maximum95
Range90
Interquartile range20

Descriptive statistics

Standard deviation15.94102208
Coef of variation0.2416519968
Kurtosis0.2813026283
Mean65.96685436
MAD12.654119
Skewness-0.630507472
Sum6712985
Variance254.1161848
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
75 26066 25.6%
 
65 22482 22.1%
 
55 17256 17.0%
 
85 17197 16.9%
 
45 9685 9.5%
 
35 3775 3.7%
 
95 2793 2.7%
 
25 1657 1.6%
 
15 691 0.7%
 
5 161 0.2%
 

Minimum 5 values

ValueCountFrequency (%) 
5 161 0.2%
 
15 691 0.7%
 
25 1657 1.6%
 
35 3775 3.7%
 
45 9685 9.5%
 

Maximum 5 values

ValueCountFrequency (%) 
95 2793 2.7%
 
85 17197 16.9%
 
75 26066 25.6%
 
65 22482 22.1%
 
55 17256 17.0%
 

change
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
54754
Ch
47009
ValueCountFrequency (%) 
No 54754 53.8%
 
Ch 47009 46.2%
 
Max length2
Mean length2
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

chlorpropamide
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101677
Steady
 
79
Up
 
6
ValueCountFrequency (%) 
No 101677 99.9%
 
Steady 79 0.1%
 
Up 6 < 0.1%
 
Down 1 < 0.1%
 
Max length6
Mean length2.003124908
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

citoglipton
Constant

This variable is constant and should be ignored for analysis

Constant valueNo

df_index
Numeric

Distinct count101763
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean50882.14641
Minimum0
Maximum101765
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile5088.1
Q125440.5
Median50882
Q376323.5
95-th percentile96676.9
Maximum101765
Range101765
Interquartile range50883

Descriptive statistics

Standard deviation29377.55192
Coef of variation0.5773646357
Kurtosis-1.1999904
Mean50882.14641
MAD25441.49596
Skewness2.123409245e-05
Sum5177919865
Variance863040557
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 101765.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
25894 1 < 0.1%
 
5416 1 < 0.1%
 
7465 1 < 0.1%
 
1322 1 < 0.1%
 
3371 1 < 0.1%
 
13612 1 < 0.1%
 
15661 1 < 0.1%
 
9518 1 < 0.1%
 
11567 1 < 0.1%
 
Other values (101753) 101753 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
0 1 < 0.1%
 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
101765 1 < 0.1%
 
101764 1 < 0.1%
 
101763 1 < 0.1%
 
101762 1 < 0.1%
 
101761 1 < 0.1%
 

diabetesmed
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Yes
78361
No
23402
ValueCountFrequency (%) 
Yes 78361 77.0%
 
No 23402 23.0%
 

diag_1
Categorical

Distinct count717
Unique (%)0.7%
Missing (%)< 0.1%
Missing (n)21
428
 
6862
414
 
6580
786
 
4016
Other values (713)
84284
ValueCountFrequency (%) 
428 6862 6.7%
 
414 6580 6.5%
 
786 4016 3.9%
 
410 3614 3.6%
 
486 3508 3.4%
 
427 2766 2.7%
 
491 2275 2.2%
 
715 2151 2.1%
 
682 2042 2.0%
 
434 2028 2.0%
 
Other values (706) 65900 64.8%
 
Max length6
Mean length3.17563358
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsTrue

diag_2
Categorical

Distinct count749
Unique (%)0.7%
Missing (%)0.4%
Missing (n)358
276
 
6752
428
 
6662
250
 
6071
Other values (745)
81920
ValueCountFrequency (%) 
276 6752 6.6%
 
428 6662 6.5%
 
250 6071 6.0%
 
427 5036 4.9%
 
401 3736 3.7%
 
496 3305 3.2%
 
599 3288 3.2%
 
403 2823 2.8%
 
414 2650 2.6%
 
411 2565 2.5%
 
Other values (738) 58517 57.5%
 
Max length6
Mean length3.173235852
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsTrue

diag_3
Categorical

Distinct count790
Unique (%)0.8%
Missing (%)1.4%
Missing (n)1423
250
 
11555
401
 
8288
276
 
5175
Other values (786)
75322
ValueCountFrequency (%) 
250 11555 11.4%
 
401 8288 8.1%
 
276 5175 5.1%
 
428 4577 4.5%
 
427 3955 3.9%
 
414 3664 3.6%
 
496 2605 2.6%
 
403 2357 2.3%
 
585 1992 2.0%
 
272 1969 1.9%
 
Other values (779) 54203 53.3%
 
Max length6
Mean length3.139618525
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsTrue

discharge_disposition_id
Numeric

Distinct count26
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean3.715515462
Minimum1
Maximum28
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q11
Median1
Q34
95-th percentile18
Maximum28
Range27
Interquartile range3

Descriptive statistics

Standard deviation5.279918511
Coef of variation1.421046034
Kurtosis6.004127661
Mean3.715515462
MAD3.48252011
Skewness2.563168617
Sum378102
Variance27.87753948
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=26)
Histogram
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 4.5 ... 23.5 24.5 26. 27.5 28. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 60232 59.2%
 
3 13954 13.7%
 
6 12902 12.7%
 
18 3691 3.6%
 
2 2128 2.1%
 
22 1992 2.0%
 
11 1642 1.6%
 
5 1184 1.2%
 
25 989 1.0%
 
4 815 0.8%
 
Other values (16) 2234 2.2%
 

Minimum 5 values

ValueCountFrequency (%) 
1 60232 59.2%
 
2 2128 2.1%
 
3 13954 13.7%
 
4 815 0.8%
 
5 1184 1.2%
 

Maximum 5 values

ValueCountFrequency (%) 
28 139 0.1%
 
27 5 < 0.1%
 
25 989 1.0%
 
24 48 < 0.1%
 
23 412 0.4%
 

encounter_id
Highly correlated

This variable is highly correlated with df_index and should be ignored for analysis

Correlation0.9678103131

examide
Constant

This variable is constant and should be ignored for analysis

Constant valueNo

gender
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Female
54708
Male
47055
ValueCountFrequency (%) 
Female 54708 53.8%
 
Male 47055 46.2%
 
Max length6
Mean length5.075204151
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

glimepiride
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
96572
Steady
 
4670
Up
 
327
ValueCountFrequency (%) 
No 96572 94.9%
 
Steady 4670 4.6%
 
Up 327 0.3%
 
Down 194 0.2%
 
Max length6
Mean length2.187376551
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

glimepiridepioglitazone
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101762
Steady
 
1
ValueCountFrequency (%) 
No 101762 > 99.9%
 
Steady 1 < 0.1%
 
Max length6
Mean length2.000039307
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

glipizide
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
89078
Steady
 
11355
Up
 
770
ValueCountFrequency (%) 
No 89078 87.5%
 
Steady 11355 11.2%
 
Up 770 0.8%
 
Down 560 0.6%
 
Max length6
Mean length2.457337146
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

glipizidemetformin
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101750
Steady
 
13
ValueCountFrequency (%) 
No 101750 > 99.9%
 
Steady 13 < 0.1%
 
Max length6
Mean length2.000510991
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

glyburide
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
91113
Steady
 
9274
Up
 
812
ValueCountFrequency (%) 
No 91113 89.5%
 
Steady 9274 9.1%
 
Up 812 0.8%
 
Down 564 0.6%
 
Max length6
Mean length2.375617857
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

glyburidemetformin
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101057
Steady
 
692
Up
 
8
ValueCountFrequency (%) 
No 101057 99.3%
 
Steady 692 0.7%
 
Up 8 < 0.1%
 
Down 6 < 0.1%
 
Max length6
Mean length2.027318377
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

insulin
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
47380
Steady
30849
Down
12218
ValueCountFrequency (%) 
No 47380 46.6%
 
Steady 30849 30.3%
 
Down 12218 12.0%
 
Up 11316 11.1%
 
Max length6
Mean length3.452708745
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

max_glu_serum
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
None
96417
Norm
 
2597
>200
 
1485
ValueCountFrequency (%) 
None 96417 94.7%
 
Norm 2597 2.6%
 
>200 1485 1.5%
 
>300 1264 1.2%
 
Max length4
Mean length4
Min length4
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsTrue

metformin
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
81776
Steady
18345
Up
 
1067
ValueCountFrequency (%) 
No 81776 80.4%
 
Steady 18345 18.0%
 
Up 1067 1.0%
 
Down 575 0.6%
 
Max length6
Mean length2.732388
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

metforminpioglitazone
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101762
Steady
 
1
ValueCountFrequency (%) 
No 101762 > 99.9%
 
Steady 1 < 0.1%
 
Max length6
Mean length2.000039307
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

metforminrosiglitazone
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101761
Steady
 
2
ValueCountFrequency (%) 
No 101761 > 99.9%
 
Steady 2 < 0.1%
 
Max length6
Mean length2.000078614
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

miglitol
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101725
Steady
 
31
Down
 
5
ValueCountFrequency (%) 
No 101725 > 99.9%
 
Steady 31 < 0.1%
 
Down 5 < 0.1%
 
Up 2 < 0.1%
 
Max length6
Mean length2.001316785
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

nateglinide
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101060
Steady
 
668
Up
 
24
ValueCountFrequency (%) 
No 101060 99.3%
 
Steady 668 0.7%
 
Up 24 < 0.1%
 
Down 11 < 0.1%
 
Max length6
Mean length2.026473276
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

num_lab_procedures
Numeric

Distinct count118
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean43.09590912
Minimum1
Maximum132
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile4
Q131
Median44
Q357
95-th percentile73
Maximum132
Range131
Interquartile range26

Descriptive statistics

Standard deviation19.67422016
Coef of variation0.4565217572
Kurtosis-0.2450422022
Mean43.09590912
MAD15.57377338
Skewness-0.2365305843
Sum4385569
Variance387.0749388
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 4.5 ... 95.5 98.5 103.5 113.5 132. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 3208 3.2%
 
43 2804 2.8%
 
44 2496 2.5%
 
45 2376 2.3%
 
38 2212 2.2%
 
40 2201 2.2%
 
46 2189 2.2%
 
41 2117 2.1%
 
42 2113 2.1%
 
47 2106 2.1%
 
Other values (108) 77941 76.6%
 

Minimum 5 values

ValueCountFrequency (%) 
1 3208 3.2%
 
2 1101 1.1%
 
3 668 0.7%
 
4 378 0.4%
 
5 285 0.3%
 

Maximum 5 values

ValueCountFrequency (%) 
132 1 < 0.1%
 
129 1 < 0.1%
 
126 1 < 0.1%
 
121 1 < 0.1%
 
120 1 < 0.1%
 

num_medications
Numeric

Distinct count75
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean16.02183505
Minimum1
Maximum81
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile6
Q110
Median15
Q320
95-th percentile31
Maximum81
Range80
Interquartile range10

Descriptive statistics

Standard deviation8.127588707
Coef of variation0.5072820112
Kurtosis3.46825301
Mean16.02183505
MAD6.108464171
Skewness1.326715874
Sum1630430
Variance66.05769818
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 1. 2.5 3.5 4.5 5.5 ... 52.5 58.5 63.5 69.5 81. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
13 6086 6.0%
 
12 6004 5.9%
 
11 5795 5.7%
 
15 5792 5.7%
 
14 5707 5.6%
 
16 5430 5.3%
 
10 5346 5.3%
 
17 4919 4.8%
 
9 4913 4.8%
 
18 4523 4.4%
 
Other values (65) 47248 46.4%
 

Minimum 5 values

ValueCountFrequency (%) 
1 262 0.3%
 
2 470 0.5%
 
3 900 0.9%
 
4 1417 1.4%
 
5 2017 2.0%
 

Maximum 5 values

ValueCountFrequency (%) 
81 1 < 0.1%
 
79 1 < 0.1%
 
75 2 < 0.1%
 
74 1 < 0.1%
 
72 3 < 0.1%
 

num_procedures
Numeric

Distinct count7
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1.339691243
Minimum0
Maximum6
Zeros (%)45.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median1
Q32
95-th percentile5
Maximum6
Range6
Interquartile range2

Descriptive statistics

Standard deviation1.705791944
Coef of variation1.273272444
Kurtosis0.8572721945
Mean1.339691243
MAD1.366799563
Skewness1.316459598
Sum136331
Variance2.909726156
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%) 
0 46652 45.8%
 
1 20741 20.4%
 
2 12716 12.5%
 
3 9443 9.3%
 
6 4954 4.9%
 
4 4180 4.1%
 
5 3077 3.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0 46652 45.8%
 
1 20741 20.4%
 
2 12716 12.5%
 
3 9443 9.3%
 
4 4180 4.1%
 

Maximum 5 values

ValueCountFrequency (%) 
6 4954 4.9%
 
5 3077 3.0%
 
4 4180 4.1%
 
3 9443 9.3%
 
2 12716 12.5%
 

number_diagnoses
Numeric

Distinct count16
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean7.422648703
Minimum1
Maximum16
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile4
Q16
Median8
Q39
95-th percentile9
Maximum16
Range15
Interquartile range3

Descriptive statistics

Standard deviation1.933577643
Coef of variation0.2604969897
Kurtosis-0.07888290487
Mean7.422648703
MAD1.668324587
Skewness-0.876799273
Sum755351
Variance3.738722502
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=16)
Histogram
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 4.5 5.5 8.5 9.5 15.5 16. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
9 49473 48.6%
 
5 11392 11.2%
 
8 10616 10.4%
 
7 10393 10.2%
 
6 10161 10.0%
 
4 5536 5.4%
 
3 2835 2.8%
 
2 1023 1.0%
 
1 219 0.2%
 
16 45 < 0.1%
 
Other values (6) 70 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 219 0.2%
 
2 1023 1.0%
 
3 2835 2.8%
 
4 5536 5.4%
 
5 11392 11.2%
 

Maximum 5 values

ValueCountFrequency (%) 
16 45 < 0.1%
 
15 10 < 0.1%
 
14 7 < 0.1%
 
13 16 < 0.1%
 
12 9 < 0.1%
 

number_emergency
Numeric

Distinct count33
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.1978420448
Minimum0
Maximum76
Zeros (%)88.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile1
Maximum76
Range76
Interquartile range0

Descriptive statistics

Standard deviation0.9304853637
Coef of variation4.703173003
Kurtosis1191.65412
Mean0.1978420448
MAD0.3514236806
Skewness22.85527244
Sum20133
Variance0.8658030121
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=33)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 7.5 11.5 13.5 23. 76. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 90380 88.8%
 
1 7677 7.5%
 
2 2042 2.0%
 
3 725 0.7%
 
4 374 0.4%
 
5 192 0.2%
 
6 94 0.1%
 
7 73 0.1%
 
8 50 < 0.1%
 
10 34 < 0.1%
 
Other values (23) 122 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 90380 88.8%
 
1 7677 7.5%
 
2 2042 2.0%
 
3 725 0.7%
 
4 374 0.4%
 

Maximum 5 values

ValueCountFrequency (%) 
76 1 < 0.1%
 
64 1 < 0.1%
 
63 1 < 0.1%
 
54 1 < 0.1%
 
46 1 < 0.1%
 

number_inpatient
Numeric

Distinct count21
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.6355846427
Minimum0
Maximum21
Zeros (%)66.5%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q31
95-th percentile3
Maximum21
Range21
Interquartile range1

Descriptive statistics

Standard deviation1.26287719
Coef of variation1.986953594
Kurtosis20.71883558
Mean0.6355846427
MAD0.8447605247
Skewness3.614085449
Sum64679
Variance1.594858797
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=21)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 9.5 11.5 13.5 16.5 21. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 67627 66.5%
 
1 19521 19.2%
 
2 7566 7.4%
 
3 3411 3.4%
 
4 1622 1.6%
 
5 812 0.8%
 
6 480 0.5%
 
7 268 0.3%
 
8 151 0.1%
 
9 111 0.1%
 
Other values (11) 194 0.2%
 

Minimum 5 values

ValueCountFrequency (%) 
0 67627 66.5%
 
1 19521 19.2%
 
2 7566 7.4%
 
3 3411 3.4%
 
4 1622 1.6%
 

Maximum 5 values

ValueCountFrequency (%) 
21 1 < 0.1%
 
19 2 < 0.1%
 
18 1 < 0.1%
 
17 1 < 0.1%
 
16 6 < 0.1%
 

number_outpatient
Numeric

Distinct count39
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.3693680414
Minimum0
Maximum42
Zeros (%)83.6%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile2
Maximum42
Range42
Interquartile range0

Descriptive statistics

Standard deviation1.267282189
Coef of variation3.430947043
Kurtosis147.9037399
Mean0.3693680414
MAD0.6172213546
Skewness8.832836868
Sum37588
Variance1.606004148
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=39)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 9.5 11.5 16.5 22.5 42. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 85024 83.6%
 
1 8547 8.4%
 
2 3594 3.5%
 
3 2042 2.0%
 
4 1099 1.1%
 
5 533 0.5%
 
6 303 0.3%
 
7 155 0.2%
 
8 98 0.1%
 
9 83 0.1%
 
Other values (29) 285 0.3%
 

Minimum 5 values

ValueCountFrequency (%) 
0 85024 83.6%
 
1 8547 8.4%
 
2 3594 3.5%
 
3 2042 2.0%
 
4 1099 1.1%
 

Maximum 5 values

ValueCountFrequency (%) 
42 1 < 0.1%
 
40 1 < 0.1%
 
39 1 < 0.1%
 
38 1 < 0.1%
 
37 1 < 0.1%
 

patient_nbr
Numeric

Distinct count71515
Unique (%)70.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean54329650.44
Minimum135
Maximum189502619
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum135
5-th percentile1456881.3
Q123412964.5
Median45500490
Q387545713.5
95-th percentile111480656.4
Maximum189502619
Range189502484
Interquartile range64132749

Descriptive statistics

Standard deviation38696580.05
Coef of variation0.7122552738
Kurtosis-0.3473394699
Mean54329650.44
MAD33216970.16
Skewness0.4713254916
Sum5.528748217e+12
Variance1.497425307e+15
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1.35000000e+02 1.30230000e+04 1.30815000e+04 1.03099500e+05 1.03167000e+05 ... 1.75829585e+08 1.80315072e+08 1.81693846e+08 1.84553753e+08 1.89502619e+08], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
88785891 40 < 0.1%
 
43140906 28 < 0.1%
 
88227540 23 < 0.1%
 
1660293 23 < 0.1%
 
23199021 23 < 0.1%
 
84428613 22 < 0.1%
 
23643405 22 < 0.1%
 
92709351 21 < 0.1%
 
23398488 20 < 0.1%
 
89472402 20 < 0.1%
 
Other values (71505) 101521 99.8%
 

Minimum 5 values

ValueCountFrequency (%) 
135 2 < 0.1%
 
378 1 < 0.1%
 
729 1 < 0.1%
 
774 1 < 0.1%
 
927 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
189502619 1 < 0.1%
 
189481478 1 < 0.1%
 
189445127 1 < 0.1%
 
189365864 1 < 0.1%
 
189351095 1 < 0.1%
 

pioglitazone
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
94436
Steady
 
6975
Up
 
234
ValueCountFrequency (%) 
No 94436 92.8%
 
Steady 6975 6.9%
 
Up 234 0.2%
 
Down 118 0.1%
 
Max length6
Mean length2.27648556
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

race
Categorical

Distinct count6
Unique (%)< 0.1%
Missing (%)2.2%
Missing (n)2271
Caucasian
76099
AfricanAmerican
19210
Hispanic
 
2037
Other values (2)
 
2146
(Missing)
 
2271
ValueCountFrequency (%) 
Caucasian 76099 74.8%
 
AfricanAmerican 19210 18.9%
 
Hispanic 2037 2.0%
 
Other 1505 1.5%
 
Asian 641 0.6%
 
(Missing) 2271 2.2%
 
Max length15
Mean length9.894362391
Min length3
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

readmitted
Categorical

Distinct count3
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
NO
54861
>30
35545
<30
11357
ValueCountFrequency (%) 
NO 54861 53.9%
 
>30 35545 34.9%
 
<30 11357 11.2%
 
Max length3
Mean length2.460894431
Min length2
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsTrue

repaglinide
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
100224
Steady
 
1384
Up
 
110
ValueCountFrequency (%) 
No 100224 98.5%
 
Steady 1384 1.4%
 
Up 110 0.1%
 
Down 45 < 0.1%
 
Max length6
Mean length2.05528532
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

rosiglitazone
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
95399
Steady
 
6099
Up
 
178
ValueCountFrequency (%) 
No 95399 93.7%
 
Steady 6099 6.0%
 
Up 178 0.2%
 
Down 87 0.1%
 
Max length6
Mean length2.241443354
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

time_in_hospital
Numeric

Distinct count14
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean4.396018199
Minimum1
Maximum14
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q12
Median4
Q36
95-th percentile11
Maximum14
Range13
Interquartile range4

Descriptive statistics

Standard deviation2.985092424
Coef of variation0.679044601
Kurtosis0.8503421103
Mean4.396018199
MAD2.35477941
Skewness1.134029798
Sum447352
Variance8.910776779
Memory size795.1 KiB
Histogram
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%) 
3 17756 17.4%
 
2 17224 16.9%
 
1 14206 14.0%
 
4 13924 13.7%
 
5 9966 9.8%
 
6 7539 7.4%
 
7 5859 5.8%
 
8 4390 4.3%
 
9 3002 2.9%
 
10 2342 2.3%
 
Other values (4) 5555 5.5%
 

Minimum 5 values

ValueCountFrequency (%) 
1 14206 14.0%
 
2 17224 16.9%
 
3 17756 17.4%
 
4 13924 13.7%
 
5 9966 9.8%
 

Maximum 5 values

ValueCountFrequency (%) 
14 1042 1.0%
 
13 1210 1.2%
 
12 1448 1.4%
 
11 1855 1.8%
 
10 2342 2.3%
 

tolazamide
Categorical

Distinct count3
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101724
Steady
 
38
Up
 
1
ValueCountFrequency (%) 
No 101724 > 99.9%
 
Steady 38 < 0.1%
 
Up 1 < 0.1%
 
Max length6
Mean length2.001493667
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

tolbutamide
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101740
Steady
 
23
ValueCountFrequency (%) 
No 101740 > 99.9%
 
Steady 23 < 0.1%
 
Max length6
Mean length2.000904061
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

troglitazone
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
No
101760
Steady
 
3
ValueCountFrequency (%) 
No 101760 > 99.9%
 
Steady 3 < 0.1%
 
Max length6
Mean length2.000117921
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Correlations

Missing values

Sample

First rows

a1cresultacarboseacetohexamideadmission_source_idadmission_type_idagechangechlorpropamidecitogliptondf_indexdiabetesmeddiag_1diag_2diag_3discharge_disposition_idencounter_idexamidegenderglimepirideglimepiridepioglitazoneglipizideglipizidemetforminglyburideglyburidemetformininsulinmax_glu_serummetforminmetforminpioglitazonemetforminrosiglitazonemiglitolnateglinidenum_lab_proceduresnum_medicationsnum_proceduresnumber_diagnosesnumber_emergencynumber_inpatientnumber_outpatientpatient_nbrpioglitazoneracereadmittedrepagliniderosiglitazonetime_in_hospitaltolazamidetolbutamidetroglitazone
0NoneNoNo165NoNoNo0No250.83NaNNaN252278392NoFemaleNoNoNoNoNoNoNoNoneNoNoNoNoNo411010008222157NoCaucasianNONoNo1NoNoNo
1NoneNoNo7115ChNoNo1Yes276250.012551149190NoFemaleNoNoNoNoNoNoUpNoneNoNoNoNoNo59180900055629189NoCaucasian>30NoNo3NoNoNo
2NoneNoNo7125NoNoNo2Yes648250V27164410NoFemaleNoNoSteadyNoNoNoNoNoneNoNoNoNoNo11135601286047875NoAfricanAmericanNONoNo2NoNoNo
3NoneNoNo7135ChNoNo3Yes8250.434031500364NoMaleNoNoNoNoNoNoUpNoneNoNoNoNoNo44161700082442376NoCaucasianNONoNo2NoNoNo
4NoneNoNo7145ChNoNo4Yes197157250116680NoMaleNoNoSteadyNoNoNoSteadyNoneNoNoNoNoNo5180500042519267NoCaucasianNONoNo1NoNoNo
5NoneNoNo2255NoNoNo5Yes414411250135754NoMaleNoNoNoNoNoNoSteadyNoneNoNoNoNoNo31166900082637451NoCaucasian>30NoNo3NoNoNo
6NoneNoNo2365ChNoNo6Yes414411V45155842NoMaleSteadyNoNoNoNoNoSteadyNoneSteadyNoNoNoNo70211700084259809NoCaucasianNONoNo4NoNoNo
7NoneNoNo7175NoNoNo7Yes428492250163768NoMaleNoNoNoNoSteadyNoNoNoneNoNoNoNoNo731208000114882984NoCaucasian>30NoNo5NoNoNo
8NoneNoNo4285ChNoNo8Yes39842738112522NoFemaleNoNoSteadyNoNoNoSteadyNoneNoNoNoNoNo68282800048330783NoCaucasianNONoNo13NoNoNo
9NoneNoNo4395ChNoNo9Yes434198486315738NoFemaleNoNoNoNoNoNoSteadyNoneNoNoNoNoNo33183800063555939NoCaucasianNONoSteady12NoNoNo

Last rows

a1cresultacarboseacetohexamideadmission_source_idadmission_type_idagechangechlorpropamidecitogliptondf_indexdiabetesmeddiag_1diag_2diag_3discharge_disposition_idencounter_idexamidegenderglimepirideglimepiridepioglitazoneglipizideglipizidemetforminglyburideglyburidemetformininsulinmax_glu_serummetforminmetforminpioglitazonemetforminrosiglitazonemiglitolnateglinidenum_lab_proceduresnum_medicationsnum_proceduresnumber_diagnosesnumber_emergencynumber_inpatientnumber_outpatientpatient_nbrpioglitazoneracereadmittedrepagliniderosiglitazonetime_in_hospitaltolazamidetolbutamidetroglitazone
101753NoneNoNo7165NoNoNo101756Yes9965854031443842070NoFemaleNoNoNoNoNoNoSteadyNoneNoNoNoNoNo461769111140199494NoOther>30NoNo2NoNoNo
101754NoneNoNo7175NoNoNo101757Yes4915185111443842136NoFemaleNoNoNoNoNoNoSteadyNoneNoNoNoNoNo211619010181593374NoCaucasianNONoNo5NoNoNo
101755NoneNoNo7185ChNoNo101758Yes29283041443842340NoFemaleNoNoNoNoNoNoUpNoneNoNoNoNoNo762219100120975314NoCaucasianNONoNo5NoNoNo
101756NoneNoNo7185ChNoNo101759Yes4357842501443842778NoMaleNoNoNoNoNoNoUpNoneNoNoNoNoNo1150700386472243NoCaucasianNONoNo1NoNoNo
101757NoneNoNo7165ChNoNo101760Yes3454384121443847176NoFemaleNoNoNoNoNoNoDownNoneNoNoNoNoNo45251912350375628NoAfricanAmerican>30NoSteady6NoNoNo
101758>8NoNo7175ChNoNo101761Yes250.132914583443847548NoMaleNoNoNoNoNoNoDownNoneSteadyNoNoNoNo511609000100162476NoAfricanAmerican>30NoNo3NoNoNo
101759NoneNoNo5185NoNoNo101762Yes5602767874443847782NoFemaleNoNoNoNoNoNoSteadyNoneNoNoNoNoNo33183901074694222NoAfricanAmericanNONoNo5NoNoNo
101760NoneNoNo7175ChNoNo101763Yes385902961443854148NoMaleNoNoNoNoNoNoDownNoneSteadyNoNoNoNo53901300141088789NoCaucasianNONoNo1NoNoNo
101761NoneNoNo7285ChNoNo101764Yes9962859983443857166NoFemaleNoNoSteadyNoNoNoUpNoneNoNoNoNoNo45212901031693671SteadyCaucasianNONoNo10NoNoNo
101762NoneNoNo7175NoNoNo101765No5305307871443867222NoMaleNoNoNoNoNoNoNoNoneNoNoNoNoNo13339000175429310NoCaucasianNONoNo6NoNoNo